Application of Pattern Recognition 


in Arson Investigation 


DAVID J. ICOVE 

University of Maryland 

HEUBERT J. CRISMAN 

Prince George’s County Fire Department 


The authors discuss the application of pattern recognition methods 
to the development of techniques for aiding the arson investigator 
in determining trends in incendiary and suspicious fires. 


RECENT study! revealed that destructive fires in 1972 in the United 
States took the lives of 11,900 and caused $2.93 billion in property 
damage. The number of incendiary and suspicious fires alone totaled 84,200 
and resulted in nearly $286 million in property loss, representing increases 
of 16.8 and 22.6 percent respectively over the previous year. 

The detection and investigation of arson-related fires remains within 
the jurisdiction of fire prevention bureaus in most cities. In large metro- 
politan areas, such as Prince George’s County Maryland where this study 
originated, heavy caseloads prevent the detailed analysis of all investi- 
gations. 

In an effort to provide the comprehensive engineering analysis of these 
investigations, a set of algorithmic computer programs have been developed 
for the fire service using pattern recognition methods. The emergence of 
FEDAP,? the Fire Engineering Data Analysis Program, paralleled the 
implementation by Prince George’s County Fire Department of MIRS,! 
the Modular Information Reporting System. Designed by the National 
Bureau of Standards, MIRS is intended to become a nationwide standara 
for the computerized encoding of emergency incidents. 


PATTERN RECOGNITION APPROACH 


Successful implementation of a supervised arson pattern recognition 
(APR) system for the prediction and prevention of incendiary crimes! 
requires a set of valid decision rules. Using labeled pattern samples of 
known arson trends, highly reliable decision rules can be derived. 

The FEDAP arson pattern recognition strategy shown in Figures 1 and 
2 uses adaptive decision rules based upon both labeled and new pattern 
samples. The arson investigator serves as an integral part of the learning 
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Figure 1. An arson investigator 
controlling the arson recognition 
system at a remote computer ter- 
minal. 


system, and the performance of the system should improve with time. 


PATTERN SAMPLES 
Each pattern sample consists of an individual arson-related fire numeric- 
ally encoded by MIRS and defined by FEDAP as a partitioned row 
matrix. The 7’th fire is represented by a seven-dimensional pattern space 
G; = (Ga Gi... Gin)?, 


such that the features describing the 7’th fire include: 
Ga = Date of incident, Gi. = Location code, Gi; = Type of incident, 
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Figure 2. The FEDAP supervised arson pattern recognition system flowchart. 
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G;, = Act or omission, G;; = Time alarm received, Gis = Day of the week, 
and Gi; = Number of alarms. 
A collection of m pattern samples or incidents is represented by a par- 
titioned matrix 

G = (GG... Gn). 

Pattern samples used in this study consist of 361 incendiary and sus- 
picious fires collected from MIRS data for February through September 
1973.5 No labeled pattern samples based upon completed investigations 
were initially made available; however, several trends were later verified 
as valid labeled patterns. 


PaTTreRN CLASSIFICATION 

The process of deriving decision rules for pattern spaces is called 
pattern classification. Decision boundaries separate pattern classes 
through a specified decision rule. Since it is difficult to visualize patterns 
greater than three dimensions, pattern classifications are frequently handled 
in three, two or one dimension. 

One or more arson related fires occurring within a single location code is 
the decision rule for defining a data “cluster” C ;, where the subscript is the 
MIRS location code of the cluster. 

Based upon a two-dimensional pattern space displaying the total 
number of clusters within a prescribed area (X.) versus the total number of 
incidents within an individual cluster (X1), several pattern classes become 
evident. For the 361 new pattern samples, four decision boundaries are 
selected as indicated in Figure 3. 

These decision boundaries define five pattern classes S1, Sz, ... Ss as 
indicated with characteristic symbols. These new pattern samples represent 
the first training set used by FEDAP in the development of a valid set of 
decision rules. These characteristic pattern class symbols are used during 
feature selection. 


FEATURE SELECTION 

Feature selection is an acceptable technique for reducing the di- 
mensionality of the G — pattern space arson data. Often the process is 
multistage after several pattern classifications made by the arson in- 
vestigator as illustrated previously in Figure 2. 

The Prince George’s County Fire Department uses a map consisting 
of 185 major map grids, each containing 24 major grids. Using feature selec- 
tion, cluster classification and the FEDAP algorithms,? eleven adjacent 
major map grids have been isolated that contain 37 percent of the 361 
total arson related fires. 

A two-dimensional display of the geographical location of these clusters 
is shown in Figure 4 using the INMAP algorithm computer map. Notice 
that all of the five pattern classes are represented. 

Continuing the feature selection process, a better visual representation 
of the higher pattern classes is displayed in Figure 5 using the algorithm 
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Figure 3. Determination of decision rules for pattern classes. 


DENMAP, a computerized contour density map. Each class previously 
defined in Figure 3 as a characteristic class symbol is displayed in a five- 
level grey scaled contour map. 

The two highest adjacent clusers, C3344 and C4171, are chosen for further 
feature selections. When displaying the clusters in G — space by two di- 
mensions, several trends appear. Figure 6 is the pattern space formed by 


I 1 4] I 
if 1 3 l 7 
I 1 3 if 
i 27 #24 Dog 
tb 5 1 r 
I 1 oil 1 oT 
I 1 14 a 2 T 
i 4 I rata The Pr hniatione 
I Po 23 1 2? 1 t Gleplaying data clusters. 
I 2 1 1225-3 if 
I 14 BoE 2 2 5 T 
if 1. 5 1 2 2. +t 1 t 
I 11 1 Be at 
if 11 2 3.4 T 
I 2 1 if 
i I 


ye 
| 
j 
{ 
{ 
t 
t 
t 
1 
| 
4 
' 
1 
{ 
1 
) 
| 
! 
t 
i 
I 
| 
4 
{ 
| 
t 
! 
| 
{ 
i 
i 
} 
H 
' 
i 
{ 
{ 
t 
aK 


Pattern Recognition 39 


LEVEL 


sywens 2 


Figure 5. The DENMAP algorithm computer map displaying the data clusters using 
characteristic symbols. 


G;5 (time alarm received) and Gi; (date of incident), while Figure 7 is the 
pattern space formed by G;; and Gis (day of the week). 

In the preceding examples, the geographically displayed clusters are 
transformed through feature selection to appear as data sheets. These 
sheets can be used as decision rules by the investigator in the prediction 
of future arson attempts. With this tool the detection, prediction and 
prevention of incendiary crimes is within the reach of the investigator. 


EXPERIMENTAL RESULTS AND DISCUSSIONS 


The two adjacent clusters, C334, and C i171, reveal time, frequency and 
geographic trends. Verified by Prince George’s County Fire Department 
arson investigators, these cases in the clusters were both linked to a group of 
juveniles. Since these incidents are valid trends, they can become labeled 
pattern samples. Modification of the adaptive decision rules by the arson 
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Figure 7. The pattern space formed by features G,; and Gis. 
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investigator would provide for detection of future arson cases with similar 
patterns. 

In further review of the data, several decision rule generalizations can 
be made. Occurring during the daylight hours, the arsons formed minor 
time clusters corresponding to breaks in the school day (0800, 1200 and 
1500 Eastern Standard Time). 

All the cases for cluster C3344 were brush fires, which occurred further 
into the day as the summer months approached. In April, the first dwelling 
fire was set, accompanied by four more through the summer months, all 
within cluster C41 and reversing the trend as shown in Figure 6. 

With only one exception, all the fires occurred during the school week 
with Wednesday as the peak day. Thursday, Tuesday and Friday were the 
least frequent days for arsons as revealed by Figure 7. Fires set towards 
the end of the week tended to occur later in the daytime. 

Further investigations into astronomical correlations for these two 
clusters showed at 75 percent of the cases were within four days of the new 
(50 percent) or full (25 percent) moon. Sufficient data is not available at 
this time for further generalizations; however, new clusters are presently 
under investigation using the APR system. 


CONCLUSIONS 


Based on the experience gained in the development and use of the APR 
system, the following recommendations are suggested for improving the 
system: 


e Expand APR to statewide and national levels through the adoption 
of uniform incident reporting systems; 

¢ Initiate the investigation of correlations of meteorological and 
astronomical phenomenon to trends in incendiary crime; and 


e Isolate a valid set of decision rules with horizontal transferability 
to be placed in a national depository for participating APR users. 
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